Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 2217 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 363.9 KiB |
| Average record size in memory | 168.1 B |
Variable types
| Numeric | 17 |
|---|---|
| DateTime | 1 |
| Categorical | 3 |
bathrooms is highly overall correlated with bedrooms and 6 other fields | High correlation |
bedrooms is highly overall correlated with bathrooms and 2 other fields | High correlation |
floors is highly overall correlated with bathrooms and 3 other fields | High correlation |
grade is highly overall correlated with bathrooms and 6 other fields | High correlation |
long is highly overall correlated with zipcode | High correlation |
price is highly overall correlated with grade and 3 other fields | High correlation |
sqft_above is highly overall correlated with bathrooms and 6 other fields | High correlation |
sqft_living is highly overall correlated with bathrooms and 5 other fields | High correlation |
sqft_living15 is highly overall correlated with bathrooms and 4 other fields | High correlation |
sqft_lot is highly overall correlated with sqft_lot15 | High correlation |
sqft_lot15 is highly overall correlated with sqft_lot | High correlation |
view is highly overall correlated with waterfront | High correlation |
waterfront is highly overall correlated with view | High correlation |
yr_built is highly overall correlated with bathrooms and 2 other fields | High correlation |
zipcode is highly overall correlated with long | High correlation |
waterfront is highly imbalanced (94.8%) | Imbalance |
view is highly imbalanced (72.9%) | Imbalance |
id has unique values | Unique |
sqft_basement has 1346 (60.7%) zeros | Zeros |
yr_renovated has 2121 (95.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-01-19 15:00:49.911113 |
|---|---|
| Analysis finished | 2024-01-19 15:01:36.678717 |
| Duration | 46.77 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIQUE 
| Distinct | 2217 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6280922 × 109 |
| Minimum | 1000102 |
|---|---|
| Maximum | 9.8393012 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 1000102 |
|---|---|
| 5-th percentile | 5.1714042 × 108 |
| Q1 | 2.1177001 × 109 |
| median | 3.9050809 × 109 |
| Q3 | 7.4629 × 109 |
| 95-th percentile | 9.3248804 × 109 |
| Maximum | 9.8393012 × 109 |
| Range | 9.8383011 × 109 |
| Interquartile range (IQR) | 5.3452 × 109 |
Descriptive statistics
| Standard deviation | 2.9104694 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.62887022 |
| Kurtosis | -1.2980905 |
| Mean | 4.6280922 × 109 |
| Median Absolute Deviation (MAD) | 2.4721806 × 109 |
| Skewness | 0.22543155 |
| Sum | 1.026048 × 1013 |
| Variance | 8.470832 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3793500160 | 1 | < 0.1% |
| 8651430870 | 1 | < 0.1% |
| 9277200180 | 1 | < 0.1% |
| 1139000072 | 1 | < 0.1% |
| 4077800455 | 1 | < 0.1% |
| 123039207 | 1 | < 0.1% |
| 9269200120 | 1 | < 0.1% |
| 3345100286 | 1 | < 0.1% |
| 1954700365 | 1 | < 0.1% |
| 6979970150 | 1 | < 0.1% |
| Other values (2207) | 2207 |
| Value | Count | Frequency (%) |
| 1000102 | 1 | |
| 3800008 | 1 | |
| 7200080 | 1 | |
| 7200179 | 1 | |
| 11501330 | 1 | |
| 11510310 | 1 | |
| 11900140 | 1 | |
| 13002460 | 1 | |
| 16000397 | 1 | |
| 16000545 | 1 |
| Value | Count | Frequency (%) |
| 9839301165 | 1 | |
| 9834201470 | 1 | |
| 9834201215 | 1 | |
| 9828702902 | 1 | |
| 9828202325 | 1 | |
| 9828202255 | 1 | |
| 9828201725 | 1 | |
| 9828201361 | 1 | |
| 9828201020 | 1 | |
| 9828200187 | 1 |
date
Date
| Distinct | 300 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 KiB |
| Minimum | 2014-05-02 00:00:00 |
|---|---|
| Maximum | 2015-05-14 00:00:00 |
price
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 975 |
|---|---|
| Distinct (%) | 44.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 538724.24 |
| Minimum | 83000 |
|---|---|
| Maximum | 3850000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 83000 |
|---|---|
| 5-th percentile | 209720 |
| Q1 | 320000 |
| median | 450000 |
| Q3 | 635000 |
| 95-th percentile | 1182000 |
| Maximum | 3850000 |
| Range | 3767000 |
| Interquartile range (IQR) | 315000 |
Descriptive statistics
| Standard deviation | 358635.06 |
|---|---|
| Coefficient of variation (CV) | 0.66571176 |
| Kurtosis | 15.935121 |
| Mean | 538724.24 |
| Median Absolute Deviation (MAD) | 149950 |
| Skewness | 3.1460339 |
| Sum | 1.1943516 × 109 |
| Variance | 1.2861911 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500000 | 20 | 0.9% |
| 400000 | 19 | 0.9% |
| 600000 | 18 | 0.8% |
| 550000 | 18 | 0.8% |
| 425000 | 18 | 0.8% |
| 350000 | 18 | 0.8% |
| 435000 | 17 | 0.8% |
| 450000 | 17 | 0.8% |
| 325000 | 17 | 0.8% |
| 280000 | 16 | 0.7% |
| Other values (965) | 2039 |
| Value | Count | Frequency (%) |
| 83000 | 1 | |
| 89000 | 1 | |
| 92000 | 1 | |
| 95000 | 1 | |
| 109500 | 1 | |
| 110000 | 2 | |
| 114975 | 1 | |
| 119500 | 1 | |
| 123000 | 1 | |
| 130000 | 2 |
| Value | Count | Frequency (%) |
| 3850000 | 1 | |
| 3420000 | 1 | |
| 3200000 | 1 | |
| 3170000 | 1 | |
| 3120000 | 1 | |
| 3000000 | 1 | |
| 2950000 | 1 | |
| 2890000 | 1 | |
| 2720000 | 1 | |
| 2700000 | 1 |
bedrooms
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.353631 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.86726292 |
|---|---|
| Coefficient of variation (CV) | 0.25860416 |
| Kurtosis | 1.2340465 |
| Mean | 3.353631 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.40140298 |
| Sum | 7435 |
| Variance | 0.75214498 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1033 | |
| 4 | 711 | |
| 2 | 274 | 12.4% |
| 5 | 156 | 7.0% |
| 1 | 20 | 0.9% |
| 6 | 19 | 0.9% |
| 7 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 20 | 0.9% |
| 2 | 274 | 12.4% |
| 3 | 1033 | |
| 4 | 711 | |
| 5 | 156 | 7.0% |
| 6 | 19 | 0.9% |
| 7 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 7 | 3 | 0.1% |
| 6 | 19 | 0.9% |
| 5 | 156 | 7.0% |
| 4 | 711 | |
| 3 | 1033 | |
| 2 | 274 | 12.4% |
| 1 | 20 | 0.9% |
bathrooms
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0990077 |
| Minimum | 0.5 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 0.5 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.5 |
| median | 2.25 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 6 |
| Range | 5.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.75756349 |
|---|---|
| Coefficient of variation (CV) | 0.36091507 |
| Kurtosis | 0.55405408 |
| Mean | 2.0990077 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.44958572 |
| Sum | 4653.5 |
| Variance | 0.57390244 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 541 | |
| 1 | 388 | |
| 1.75 | 345 | |
| 2 | 203 | 9.2% |
| 2.25 | 196 | 8.8% |
| 1.5 | 158 | 7.1% |
| 2.75 | 119 | 5.4% |
| 3.5 | 75 | 3.4% |
| 3 | 74 | 3.3% |
| 3.25 | 51 | 2.3% |
| Other values (9) | 67 | 3.0% |
| Value | Count | Frequency (%) |
| 0.5 | 1 | < 0.1% |
| 0.75 | 9 | 0.4% |
| 1 | 388 | |
| 1.5 | 158 | 7.1% |
| 1.75 | 345 | |
| 2 | 203 | 9.2% |
| 2.25 | 196 | 8.8% |
| 2.5 | 541 | |
| 2.75 | 119 | 5.4% |
| 3 | 74 | 3.3% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 3 | 0.1% |
| 4.75 | 2 | 0.1% |
| 4.5 | 15 | 0.7% |
| 4.25 | 8 | 0.4% |
| 4 | 14 | 0.6% |
| 3.75 | 14 | 0.6% |
| 3.5 | 75 | |
| 3.25 | 51 | |
| 3 | 74 |
sqft_living
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 431 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2073.4398 |
| Minimum | 420 |
|---|---|
| Maximum | 7850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 420 |
|---|---|
| 5-th percentile | 920 |
| Q1 | 1460 |
| median | 1910 |
| Q3 | 2490 |
| 95-th percentile | 3832 |
| Maximum | 7850 |
| Range | 7430 |
| Interquartile range (IQR) | 1030 |
Descriptive statistics
| Standard deviation | 897.05421 |
|---|---|
| Coefficient of variation (CV) | 0.43264059 |
| Kurtosis | 2.808911 |
| Mean | 2073.4398 |
| Median Absolute Deviation (MAD) | 510 |
| Skewness | 1.2995045 |
| Sum | 4596816 |
| Variance | 804706.25 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2100 | 19 | 0.9% |
| 1560 | 18 | 0.8% |
| 1610 | 18 | 0.8% |
| 1530 | 18 | 0.8% |
| 1540 | 18 | 0.8% |
| 1630 | 17 | 0.8% |
| 1180 | 17 | 0.8% |
| 1010 | 17 | 0.8% |
| 1470 | 17 | 0.8% |
| 1480 | 16 | 0.7% |
| Other values (421) | 2042 |
| Value | Count | Frequency (%) |
| 420 | 1 | < 0.1% |
| 550 | 1 | < 0.1% |
| 560 | 1 | < 0.1% |
| 600 | 1 | < 0.1% |
| 620 | 2 | 0.1% |
| 700 | 2 | 0.1% |
| 710 | 2 | 0.1% |
| 720 | 7 | |
| 730 | 3 | |
| 740 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7850 | 1 | |
| 7120 | 1 | |
| 6380 | 1 | |
| 6085 | 1 | |
| 6050 | 1 | |
| 5840 | 1 | |
| 5810 | 1 | |
| 5780 | 1 | |
| 5770 | 2 | |
| 5710 | 1 |
sqft_lot
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1627 |
|---|---|
| Distinct (%) | 73.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13554.643 |
| Minimum | 683 |
|---|---|
| Maximum | 435600 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 683 |
|---|---|
| 5-th percentile | 1984.8 |
| Q1 | 5000 |
| median | 7526 |
| Q3 | 10464 |
| 95-th percentile | 41207.8 |
| Maximum | 435600 |
| Range | 434917 |
| Interquartile range (IQR) | 5464 |
Descriptive statistics
| Standard deviation | 29606.43 |
|---|---|
| Coefficient of variation (CV) | 2.1842279 |
| Kurtosis | 78.528563 |
| Mean | 13554.643 |
| Median Absolute Deviation (MAD) | 2581 |
| Skewness | 7.9126428 |
| Sum | 30050644 |
| Variance | 8.7654072 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4000 | 32 | 1.4% |
| 5000 | 29 | 1.3% |
| 6000 | 26 | 1.2% |
| 7200 | 24 | 1.1% |
| 4500 | 19 | 0.9% |
| 9600 | 14 | 0.6% |
| 8400 | 12 | 0.5% |
| 7500 | 12 | 0.5% |
| 9000 | 12 | 0.5% |
| 4800 | 12 | 0.5% |
| Other values (1617) | 2025 |
| Value | Count | Frequency (%) |
| 683 | 1 | |
| 745 | 1 | |
| 804 | 1 | |
| 809 | 1 | |
| 812 | 1 | |
| 825 | 1 | |
| 834 | 1 | |
| 844 | 1 | |
| 892 | 1 | |
| 932 | 1 |
| Value | Count | Frequency (%) |
| 435600 | 1 | |
| 403693 | 1 | |
| 384634 | 1 | |
| 360241 | 1 | |
| 344124 | 1 | |
| 313672 | 1 | |
| 250905 | 1 | |
| 231739 | 1 | |
| 231303 | 1 | |
| 218472 | 1 |
floors
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.496166 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.5 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.54355321 |
|---|---|
| Coefficient of variation (CV) | 0.3632974 |
| Kurtosis | -0.36460892 |
| Mean | 1.496166 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.65224001 |
| Sum | 3317 |
| Variance | 0.2954501 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1091 | |
| 2 | 837 | |
| 1.5 | 206 | 9.3% |
| 3 | 67 | 3.0% |
| 2.5 | 14 | 0.6% |
| 3.5 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1091 | |
| 1.5 | 206 | 9.3% |
| 2 | 837 | |
| 2.5 | 14 | 0.6% |
| 3 | 67 | 3.0% |
| 3.5 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 2 | 0.1% |
| 3 | 67 | 3.0% |
| 2.5 | 14 | 0.6% |
| 2 | 837 | |
| 1.5 | 206 | 9.3% |
| 1 | 1091 |
waterfront
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 KiB |
| 0 | |
|---|---|
| 1 | 13 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2217 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2217 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2217 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2217 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2204 | |
| 1 | 13 | 0.6% |
view
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 KiB |
| 0 | |
|---|---|
| 2 | 92 |
| 3 | 57 |
| 1 | 33 |
| 4 | 29 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2217 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2217 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2217 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2217 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2006 | |
| 2 | 92 | 4.1% |
| 3 | 57 | 2.6% |
| 1 | 33 | 1.5% |
| 4 | 29 | 1.3% |
condition
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 KiB |
| 3 | |
|---|---|
| 4 | |
| 5 | |
| 2 | 23 |
| 1 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2217 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2217 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2217 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2217 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1437 | |
| 4 | 564 | 25.4% |
| 5 | 190 | 8.6% |
| 2 | 23 | 1.0% |
| 1 | 3 | 0.1% |
grade
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6481732 |
| Minimum | 4 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 12 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1509639 |
|---|---|
| Coefficient of variation (CV) | 0.15048874 |
| Kurtosis | 1.1697504 |
| Mean | 7.6481732 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.83093809 |
| Sum | 16956 |
| Variance | 1.324718 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 933 | |
| 8 | 653 | |
| 9 | 238 | 10.7% |
| 6 | 204 | 9.2% |
| 10 | 111 | 5.0% |
| 11 | 47 | 2.1% |
| 5 | 20 | 0.9% |
| 12 | 8 | 0.4% |
| 4 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 4 | 3 | 0.1% |
| 5 | 20 | 0.9% |
| 6 | 204 | 9.2% |
| 7 | 933 | |
| 8 | 653 | |
| 9 | 238 | 10.7% |
| 10 | 111 | 5.0% |
| 11 | 47 | 2.1% |
| 12 | 8 | 0.4% |
| Value | Count | Frequency (%) |
| 12 | 8 | 0.4% |
| 11 | 47 | 2.1% |
| 10 | 111 | 5.0% |
| 9 | 238 | 10.7% |
| 8 | 653 | |
| 7 | 933 | |
| 6 | 204 | 9.2% |
| 5 | 20 | 0.9% |
| 4 | 3 | 0.1% |
sqft_above
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 403 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1791.4312 |
| Minimum | 420 |
|---|---|
| Maximum | 7850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 420 |
|---|---|
| 5-th percentile | 840 |
| Q1 | 1200 |
| median | 1560 |
| Q3 | 2220 |
| 95-th percentile | 3390 |
| Maximum | 7850 |
| Range | 7430 |
| Interquartile range (IQR) | 1020 |
Descriptive statistics
| Standard deviation | 836.47749 |
|---|---|
| Coefficient of variation (CV) | 0.46693252 |
| Kurtosis | 3.4746521 |
| Mean | 1791.4312 |
| Median Absolute Deviation (MAD) | 450 |
| Skewness | 1.5060442 |
| Sum | 3971603 |
| Variance | 699694.59 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1010 | 28 | 1.3% |
| 1180 | 27 | 1.2% |
| 1200 | 26 | 1.2% |
| 1250 | 23 | 1.0% |
| 1390 | 22 | 1.0% |
| 1510 | 21 | 0.9% |
| 1320 | 21 | 0.9% |
| 1220 | 21 | 0.9% |
| 1060 | 20 | 0.9% |
| 1560 | 19 | 0.9% |
| Other values (393) | 1989 |
| Value | Count | Frequency (%) |
| 420 | 1 | < 0.1% |
| 550 | 1 | < 0.1% |
| 560 | 2 | |
| 570 | 1 | < 0.1% |
| 580 | 3 | |
| 600 | 1 | < 0.1% |
| 610 | 1 | < 0.1% |
| 620 | 2 | |
| 660 | 1 | < 0.1% |
| 680 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7850 | 1 | |
| 6380 | 1 | |
| 6085 | 1 | |
| 5770 | 1 | |
| 5710 | 1 | |
| 5670 | 1 | |
| 5480 | 1 | |
| 5450 | 1 | |
| 5250 | 1 | |
| 5020 | 1 |
sqft_basement
Real number (ℝ)
ZEROS 
| Distinct | 170 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 282.00857 |
| Minimum | 0 |
|---|---|
| Maximum | 2570 |
| Zeros | 1346 |
| Zeros (%) | 60.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 540 |
| 95-th percentile | 1100 |
| Maximum | 2570 |
| Range | 2570 |
| Interquartile range (IQR) | 540 |
Descriptive statistics
| Standard deviation | 423.9148 |
|---|---|
| Coefficient of variation (CV) | 1.5031983 |
| Kurtosis | 1.8844827 |
| Mean | 282.00857 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4960736 |
| Sum | 625213 |
| Variance | 179703.76 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1346 | |
| 600 | 25 | 1.1% |
| 400 | 24 | 1.1% |
| 700 | 23 | 1.0% |
| 500 | 23 | 1.0% |
| 800 | 22 | 1.0% |
| 300 | 17 | 0.8% |
| 750 | 16 | 0.7% |
| 900 | 14 | 0.6% |
| 1000 | 13 | 0.6% |
| Other values (160) | 694 |
| Value | Count | Frequency (%) |
| 0 | 1346 | |
| 40 | 1 | < 0.1% |
| 50 | 3 | 0.1% |
| 80 | 2 | 0.1% |
| 90 | 3 | 0.1% |
| 100 | 3 | 0.1% |
| 110 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 130 | 4 | 0.2% |
| 140 | 7 | 0.3% |
| Value | Count | Frequency (%) |
| 2570 | 1 | |
| 2250 | 1 | |
| 2240 | 1 | |
| 2220 | 1 | |
| 2150 | 2 | |
| 2060 | 1 | |
| 2020 | 1 | |
| 1900 | 1 | |
| 1870 | 1 | |
| 1852 | 1 |
yr_built
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 116 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.0465 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1951 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2012 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 29.505233 |
|---|---|
| Coefficient of variation (CV) | 0.014969324 |
| Kurtosis | -0.69137063 |
| Mean | 1971.0465 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.44484625 |
| Sum | 4369810 |
| Variance | 870.55876 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 69 | 3.1% |
| 2006 | 55 | 2.5% |
| 2003 | 52 | 2.3% |
| 2004 | 47 | 2.1% |
| 2007 | 44 | 2.0% |
| 1977 | 42 | 1.9% |
| 1967 | 42 | 1.9% |
| 2008 | 41 | 1.8% |
| 1978 | 39 | 1.8% |
| 1959 | 36 | 1.6% |
| Other values (106) | 1750 |
| Value | Count | Frequency (%) |
| 1900 | 8 | |
| 1901 | 3 | 0.1% |
| 1902 | 5 | |
| 1903 | 3 | 0.1% |
| 1904 | 1 | < 0.1% |
| 1905 | 5 | |
| 1906 | 12 | |
| 1907 | 9 | |
| 1908 | 9 | |
| 1909 | 6 |
| Value | Count | Frequency (%) |
| 2015 | 4 | 0.2% |
| 2014 | 69 | |
| 2013 | 25 | 1.1% |
| 2012 | 18 | 0.8% |
| 2011 | 14 | 0.6% |
| 2010 | 10 | 0.5% |
| 2009 | 22 | 1.0% |
| 2008 | 41 | |
| 2007 | 44 | |
| 2006 | 55 |
yr_renovated
Real number (ℝ)
ZEROS 
| Distinct | 49 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.330627 |
| Minimum | 0 |
|---|---|
| Maximum | 2015 |
| Zeros | 2121 |
| Zeros (%) | 95.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2015 |
| Range | 2015 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 405.89326 |
|---|---|
| Coefficient of variation (CV) | 4.7016138 |
| Kurtosis | 18.188604 |
| Mean | 86.330627 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.4911723 |
| Sum | 191395 |
| Variance | 164749.34 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2121 | |
| 2005 | 7 | 0.3% |
| 2000 | 6 | 0.3% |
| 2010 | 5 | 0.2% |
| 2013 | 4 | 0.2% |
| 2003 | 4 | 0.2% |
| 1989 | 3 | 0.1% |
| 2014 | 3 | 0.1% |
| 1999 | 3 | 0.1% |
| 2015 | 3 | 0.1% |
| Other values (39) | 58 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 2121 | |
| 1951 | 1 | < 0.1% |
| 1953 | 1 | < 0.1% |
| 1956 | 1 | < 0.1% |
| 1958 | 1 | < 0.1% |
| 1960 | 1 | < 0.1% |
| 1965 | 1 | < 0.1% |
| 1968 | 1 | < 0.1% |
| 1969 | 1 | < 0.1% |
| 1970 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 3 | |
| 2014 | 3 | |
| 2013 | 4 | |
| 2012 | 1 | < 0.1% |
| 2010 | 5 | |
| 2009 | 2 | 0.1% |
| 2008 | 1 | < 0.1% |
| 2007 | 1 | < 0.1% |
| 2006 | 1 | < 0.1% |
| 2005 | 7 |
zipcode
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 70 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98079.107 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98033 |
| median | 98070 |
| Q3 | 98118 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 85 |
Descriptive statistics
| Standard deviation | 52.95195 |
|---|---|
| Coefficient of variation (CV) | 0.00053989022 |
| Kurtosis | -0.85409519 |
| Mean | 98079.107 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 0.3853781 |
| Sum | 2.1744138 × 108 |
| Variance | 2803.909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98038 | 70 | 3.2% |
| 98117 | 65 | 2.9% |
| 98052 | 62 | 2.8% |
| 98059 | 60 | 2.7% |
| 98115 | 60 | 2.7% |
| 98042 | 59 | 2.7% |
| 98103 | 57 | 2.6% |
| 98118 | 56 | 2.5% |
| 98058 | 51 | 2.3% |
| 98023 | 50 | 2.3% |
| Other values (60) | 1627 |
| Value | Count | Frequency (%) |
| 98001 | 32 | |
| 98002 | 24 | |
| 98003 | 28 | |
| 98004 | 29 | |
| 98005 | 16 | 0.7% |
| 98006 | 44 | |
| 98007 | 12 | 0.5% |
| 98008 | 30 | |
| 98010 | 6 | 0.3% |
| 98011 | 15 | 0.7% |
| Value | Count | Frequency (%) |
| 98199 | 31 | |
| 98198 | 26 | |
| 98188 | 15 | 0.7% |
| 98178 | 25 | |
| 98177 | 27 | |
| 98168 | 39 | |
| 98166 | 26 | |
| 98155 | 44 | |
| 98148 | 12 | 0.5% |
| 98146 | 25 |
lat
Real number (ℝ)
| Distinct | 1752 |
|---|---|
| Distinct (%) | 79.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.557274 |
| Minimum | 47.1942 |
|---|---|
| Maximum | 47.7775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 47.1942 |
|---|---|
| 5-th percentile | 47.31438 |
| Q1 | 47.4698 |
| median | 47.567 |
| Q3 | 47.6745 |
| 95-th percentile | 47.74536 |
| Maximum | 47.7775 |
| Range | 0.5833 |
| Interquartile range (IQR) | 0.2047 |
Descriptive statistics
| Standard deviation | 0.13614404 |
|---|---|
| Coefficient of variation (CV) | 0.0028627385 |
| Kurtosis | -0.72368939 |
| Mean | 47.557274 |
| Median Absolute Deviation (MAD) | 0.1044 |
| Skewness | -0.44161142 |
| Sum | 105434.48 |
| Variance | 0.018535199 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.5666 | 5 | 0.2% |
| 47.6681 | 4 | 0.2% |
| 47.6513 | 4 | 0.2% |
| 47.5265 | 4 | 0.2% |
| 47.567 | 4 | 0.2% |
| 47.5634 | 4 | 0.2% |
| 47.6993 | 4 | 0.2% |
| 47.6988 | 4 | 0.2% |
| 47.3338 | 4 | 0.2% |
| 47.5443 | 4 | 0.2% |
| Other values (1742) | 2176 |
| Value | Count | Frequency (%) |
| 47.1942 | 1 | |
| 47.1947 | 2 | |
| 47.1948 | 1 | |
| 47.1983 | 1 | |
| 47.2016 | 1 | |
| 47.2026 | 1 | |
| 47.2048 | 1 | |
| 47.2058 | 1 | |
| 47.2068 | 1 | |
| 47.2082 | 1 |
| Value | Count | Frequency (%) |
| 47.7775 | 1 | |
| 47.7769 | 2 | |
| 47.7762 | 1 | |
| 47.7757 | 2 | |
| 47.7756 | 1 | |
| 47.7751 | 2 | |
| 47.7744 | 1 | |
| 47.7743 | 1 | |
| 47.7742 | 1 | |
| 47.774 | 1 |
long
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 511 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.21522 |
| Minimum | -122.511 |
|---|---|
| Maximum | -121.352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2217 |
| Negative (%) | 100.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | -122.511 |
|---|---|
| 5-th percentile | -122.387 |
| Q1 | -122.329 |
| median | -122.235 |
| Q3 | -122.127 |
| 95-th percentile | -121.9816 |
| Maximum | -121.352 |
| Range | 1.159 |
| Interquartile range (IQR) | 0.202 |
Descriptive statistics
| Standard deviation | 0.14079072 |
|---|---|
| Coefficient of variation (CV) | -0.0011519901 |
| Kurtosis | 0.83088139 |
| Mean | -122.21522 |
| Median Absolute Deviation (MAD) | 0.1 |
| Skewness | 0.86558437 |
| Sum | -270951.14 |
| Variance | 0.019822027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.304 | 18 | 0.8% |
| -122.35 | 15 | 0.7% |
| -122.285 | 14 | 0.6% |
| -122.298 | 13 | 0.6% |
| -122.391 | 13 | 0.6% |
| -122.352 | 13 | 0.6% |
| -122.291 | 13 | 0.6% |
| -122.376 | 12 | 0.5% |
| -122.287 | 12 | 0.5% |
| -122.289 | 12 | 0.5% |
| Other values (501) | 2082 |
| Value | Count | Frequency (%) |
| -122.511 | 2 | |
| -122.509 | 1 | |
| -122.497 | 1 | |
| -122.484 | 1 | |
| -122.482 | 1 | |
| -122.474 | 1 | |
| -122.463 | 1 | |
| -122.462 | 1 | |
| -122.448 | 1 | |
| -122.44 | 1 |
| Value | Count | Frequency (%) |
| -121.352 | 1 | |
| -121.707 | 1 | |
| -121.709 | 1 | |
| -121.714 | 1 | |
| -121.718 | 1 | |
| -121.735 | 1 | |
| -121.738 | 1 | |
| -121.745 | 1 | |
| -121.746 | 1 | |
| -121.747 | 1 |
sqft_living15
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 363 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1985.8751 |
| Minimum | 399 |
|---|---|
| Maximum | 6210 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 399 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 1490 |
| median | 1830 |
| Q3 | 2370 |
| 95-th percentile | 3282 |
| Maximum | 6210 |
| Range | 5811 |
| Interquartile range (IQR) | 880 |
Descriptive statistics
| Standard deviation | 686.14912 |
|---|---|
| Coefficient of variation (CV) | 0.34551475 |
| Kurtosis | 2.116041 |
| Mean | 1985.8751 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 1.1845908 |
| Sum | 4402685 |
| Variance | 470800.61 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1420 | 23 | 1.0% |
| 1540 | 23 | 1.0% |
| 1800 | 22 | 1.0% |
| 1500 | 22 | 1.0% |
| 1530 | 22 | 1.0% |
| 1770 | 21 | 0.9% |
| 1300 | 21 | 0.9% |
| 1640 | 20 | 0.9% |
| 1390 | 20 | 0.9% |
| 1690 | 20 | 0.9% |
| Other values (353) | 2003 |
| Value | Count | Frequency (%) |
| 399 | 1 | < 0.1% |
| 620 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 780 | 1 | < 0.1% |
| 830 | 2 | |
| 840 | 2 | |
| 860 | 3 | |
| 870 | 1 | < 0.1% |
| 880 | 1 | < 0.1% |
| 900 | 4 |
| Value | Count | Frequency (%) |
| 6210 | 1 | |
| 5600 | 1 | |
| 5330 | 1 | |
| 4850 | 1 | |
| 4830 | 1 | |
| 4760 | 1 | |
| 4620 | 2 | |
| 4560 | 1 | |
| 4480 | 1 | |
| 4470 | 1 |
sqft_lot15
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1582 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12147.815 |
| Minimum | 755 |
|---|---|
| Maximum | 292645 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.4 KiB |
Quantile statistics
| Minimum | 755 |
|---|---|
| 5-th percentile | 2268.4 |
| Q1 | 5078 |
| median | 7551 |
| Q3 | 10000 |
| 95-th percentile | 36430.8 |
| Maximum | 292645 |
| Range | 291890 |
| Interquartile range (IQR) | 4922 |
Descriptive statistics
| Standard deviation | 22904.987 |
|---|---|
| Coefficient of variation (CV) | 1.8855232 |
| Kurtosis | 56.973284 |
| Mean | 12147.815 |
| Median Absolute Deviation (MAD) | 2463 |
| Skewness | 6.9473654 |
| Sum | 26931706 |
| Variance | 5.2463841 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 46 | 2.1% |
| 4000 | 43 | 1.9% |
| 6000 | 29 | 1.3% |
| 7200 | 23 | 1.0% |
| 7800 | 14 | 0.6% |
| 7350 | 13 | 0.6% |
| 9600 | 11 | 0.5% |
| 4080 | 11 | 0.5% |
| 7500 | 11 | 0.5% |
| 5500 | 11 | 0.5% |
| Other values (1572) | 2005 |
| Value | Count | Frequency (%) |
| 755 | 1 | |
| 824 | 1 | |
| 886 | 1 | |
| 942 | 2 | |
| 955 | 2 | |
| 1003 | 1 | |
| 1007 | 1 | |
| 1026 | 1 | |
| 1062 | 1 | |
| 1079 | 1 |
| Value | Count | Frequency (%) |
| 292645 | 1 | < 0.1% |
| 275299 | 1 | < 0.1% |
| 220849 | 1 | < 0.1% |
| 217800 | 3 | |
| 212137 | 1 | < 0.1% |
| 211404 | 1 | < 0.1% |
| 209959 | 1 | < 0.1% |
| 207781 | 1 | < 0.1% |
| 202554 | 1 | < 0.1% |
| 199504 | 1 | < 0.1% |
| bathrooms | bedrooms | condition | floors | grade | id | lat | long | price | sqft_above | sqft_basement | sqft_living | sqft_living15 | sqft_lot | sqft_lot15 | view | waterfront | yr_built | yr_renovated | zipcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bathrooms | 1.000 | 0.522 | 0.124 | 0.561 | 0.672 | 0.043 | 0.012 | 0.270 | 0.488 | 0.691 | 0.158 | 0.745 | 0.568 | 0.041 | 0.044 | 0.143 | 0.052 | 0.606 | 0.022 | -0.213 |
| bedrooms | 0.522 | 1.000 | 0.066 | 0.254 | 0.405 | 0.038 | -0.000 | 0.216 | 0.348 | 0.549 | 0.179 | 0.643 | 0.444 | 0.214 | 0.206 | 0.030 | 0.076 | 0.218 | -0.012 | -0.186 |
| condition | 0.124 | 0.066 | 1.000 | -0.307 | -0.201 | -0.018 | 0.010 | -0.109 | 0.016 | -0.190 | 0.155 | -0.100 | -0.105 | 0.110 | 0.111 | 0.012 | 0.000 | -0.387 | -0.072 | -0.002 |
| floors | 0.561 | 0.254 | -0.307 | 1.000 | 0.513 | 0.049 | 0.025 | 0.166 | 0.314 | 0.610 | -0.284 | 0.419 | 0.323 | -0.237 | -0.218 | 0.031 | 0.000 | 0.563 | 0.015 | -0.082 |
| grade | 0.672 | 0.405 | -0.201 | 0.513 | 1.000 | 0.046 | 0.094 | 0.223 | 0.626 | 0.716 | 0.074 | 0.726 | 0.664 | 0.114 | 0.135 | 0.146 | 0.120 | 0.530 | 0.007 | -0.191 |
| id | 0.043 | 0.038 | -0.018 | 0.049 | 0.046 | 1.000 | 0.000 | 0.059 | 0.023 | 0.041 | -0.029 | 0.023 | 0.034 | -0.120 | -0.101 | 0.014 | 0.053 | 0.034 | 0.014 | -0.028 |
| lat | 0.012 | -0.000 | 0.010 | 0.025 | 0.094 | 0.000 | 1.000 | -0.171 | 0.494 | -0.013 | 0.136 | 0.048 | 0.040 | -0.115 | -0.116 | 0.070 | 0.000 | -0.143 | 0.022 | 0.257 |
| long | 0.270 | 0.216 | -0.109 | 0.166 | 0.223 | 0.059 | -0.171 | 1.000 | 0.030 | 0.373 | -0.223 | 0.255 | 0.373 | 0.369 | 0.371 | 0.099 | 0.161 | 0.421 | -0.087 | -0.580 |
| price | 0.488 | 0.348 | 0.016 | 0.314 | 0.626 | 0.023 | 0.494 | 0.030 | 1.000 | 0.524 | 0.254 | 0.639 | 0.563 | 0.040 | 0.043 | 0.227 | 0.342 | 0.108 | 0.114 | 0.015 |
| sqft_above | 0.691 | 0.549 | -0.190 | 0.610 | 0.716 | 0.041 | -0.013 | 0.373 | 0.524 | 1.000 | -0.200 | 0.842 | 0.693 | 0.249 | 0.245 | 0.088 | 0.130 | 0.490 | 0.040 | -0.276 |
| sqft_basement | 0.158 | 0.179 | 0.155 | -0.284 | 0.074 | -0.029 | 0.136 | -0.223 | 0.254 | -0.200 | 1.000 | 0.297 | 0.098 | -0.010 | -0.019 | 0.186 | 0.079 | -0.200 | 0.058 | 0.145 |
| sqft_living | 0.745 | 0.643 | -0.100 | 0.419 | 0.726 | 0.023 | 0.048 | 0.255 | 0.639 | 0.842 | 0.297 | 1.000 | 0.734 | 0.259 | 0.249 | 0.164 | 0.090 | 0.366 | 0.058 | -0.193 |
| sqft_living15 | 0.568 | 0.444 | -0.105 | 0.323 | 0.664 | 0.034 | 0.040 | 0.373 | 0.563 | 0.693 | 0.098 | 0.734 | 1.000 | 0.321 | 0.340 | 0.121 | 0.091 | 0.349 | 0.004 | -0.297 |
| sqft_lot | 0.041 | 0.214 | 0.110 | -0.237 | 0.114 | -0.120 | -0.115 | 0.369 | 0.040 | 0.249 | -0.010 | 0.259 | 0.321 | 1.000 | 0.921 | 0.055 | 0.116 | -0.047 | 0.024 | -0.301 |
| sqft_lot15 | 0.044 | 0.206 | 0.111 | -0.218 | 0.135 | -0.101 | -0.116 | 0.371 | 0.043 | 0.245 | -0.019 | 0.249 | 0.340 | 0.921 | 1.000 | 0.089 | 0.176 | -0.015 | 0.015 | -0.304 |
| view | 0.143 | 0.030 | 0.012 | 0.031 | 0.146 | 0.014 | 0.070 | 0.099 | 0.227 | 0.088 | 0.186 | 0.164 | 0.121 | 0.055 | 0.089 | 1.000 | 0.614 | -0.067 | 0.159 | 0.097 |
| waterfront | 0.052 | 0.076 | 0.000 | 0.000 | 0.120 | 0.053 | 0.000 | 0.161 | 0.342 | 0.130 | 0.079 | 0.090 | 0.091 | 0.116 | 0.176 | 0.614 | 1.000 | -0.042 | 0.156 | 0.013 |
| yr_built | 0.606 | 0.218 | -0.387 | 0.563 | 0.530 | 0.034 | -0.143 | 0.421 | 0.108 | 0.490 | -0.200 | 0.366 | 0.349 | -0.047 | -0.015 | -0.067 | -0.042 | 1.000 | -0.220 | -0.337 |
| yr_renovated | 0.022 | -0.012 | -0.072 | 0.015 | 0.007 | 0.014 | 0.022 | -0.087 | 0.114 | 0.040 | 0.058 | 0.058 | 0.004 | 0.024 | 0.015 | 0.159 | 0.156 | -0.220 | 1.000 | 0.092 |
| zipcode | -0.213 | -0.186 | -0.002 | -0.082 | -0.191 | -0.028 | 0.257 | -0.580 | 0.015 | -0.276 | 0.145 | -0.193 | -0.297 | -0.301 | -0.304 | 0.097 | 0.013 | -0.337 | 0.092 | 1.000 |
| id | date | price | bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | sqft_lot15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3793500160 | 20150312T000000 | 323000.0 | 3 | 2.50 | 1890 | 6560 | 2.0 | 0 | 0 | 3 | 7 | 1890 | 0 | 2003 | 0 | 98038 | 47.3684 | -122.031 | 2390 | 7570 |
| 1 | 1175000570 | 20150312T000000 | 530000.0 | 5 | 2.00 | 1810 | 4850 | 1.5 | 0 | 0 | 3 | 7 | 1810 | 0 | 1900 | 0 | 98107 | 47.6700 | -122.394 | 1360 | 4850 |
| 2 | 16000397 | 20141205T000000 | 189000.0 | 2 | 1.00 | 1200 | 9850 | 1.0 | 0 | 0 | 4 | 7 | 1200 | 0 | 1921 | 0 | 98002 | 47.3089 | -122.210 | 1060 | 5095 |
| 3 | 461000390 | 20140624T000000 | 687500.0 | 4 | 1.75 | 2330 | 5000 | 1.5 | 0 | 0 | 4 | 7 | 1510 | 820 | 1929 | 0 | 98117 | 47.6823 | -122.368 | 1460 | 5000 |
| 4 | 7895500070 | 20150213T000000 | 240000.0 | 4 | 1.00 | 1220 | 8075 | 1.0 | 0 | 0 | 2 | 7 | 890 | 330 | 1969 | 0 | 98001 | 47.3341 | -122.282 | 1290 | 7800 |
| 5 | 3626039271 | 20150205T000000 | 585000.0 | 2 | 1.75 | 1980 | 8550 | 1.0 | 0 | 0 | 3 | 7 | 990 | 990 | 1981 | 0 | 98117 | 47.6989 | -122.369 | 1480 | 6738 |
| 6 | 1189001180 | 20140603T000000 | 425000.0 | 3 | 2.25 | 1660 | 6000 | 1.0 | 0 | 0 | 3 | 7 | 1110 | 550 | 1979 | 0 | 98122 | 47.6113 | -122.297 | 1440 | 4080 |
| 7 | 7214720075 | 20141212T000000 | 699950.0 | 3 | 2.25 | 2190 | 107593 | 2.0 | 0 | 0 | 4 | 8 | 2190 | 0 | 1983 | 0 | 98077 | 47.7731 | -122.080 | 2570 | 47777 |
| 8 | 1328310370 | 20150402T000000 | 375000.0 | 3 | 2.50 | 2340 | 10005 | 1.0 | 0 | 0 | 4 | 8 | 1460 | 880 | 1978 | 0 | 98058 | 47.4431 | -122.133 | 2250 | 8162 |
| 9 | 4060000240 | 20140623T000000 | 205425.0 | 2 | 1.00 | 880 | 6780 | 1.0 | 0 | 0 | 4 | 6 | 880 | 0 | 1945 | 0 | 98178 | 47.5009 | -122.248 | 1190 | 6780 |
| id | date | price | bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | sqft_lot15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2207 | 2937300040 | 20141215T000000 | 942990.0 | 4 | 2.50 | 3570 | 6218 | 2.0 | 0 | 0 | 3 | 9 | 3570 | 0 | 2014 | 0 | 98052 | 47.7046 | -122.123 | 3230 | 5972 |
| 2208 | 7430500110 | 20141209T000000 | 1380000.0 | 5 | 3.50 | 5150 | 12230 | 2.0 | 0 | 2 | 3 | 10 | 3700 | 1450 | 2007 | 0 | 98008 | 47.6249 | -122.090 | 2940 | 13462 |
| 2209 | 3304300300 | 20150507T000000 | 579950.0 | 4 | 2.75 | 2460 | 8643 | 2.0 | 0 | 0 | 3 | 9 | 2460 | 0 | 2011 | 0 | 98059 | 47.4828 | -122.133 | 3110 | 8626 |
| 2210 | 6453550090 | 20150505T000000 | 861111.0 | 4 | 2.50 | 3650 | 7090 | 2.0 | 0 | 0 | 3 | 10 | 3650 | 0 | 2008 | 0 | 98074 | 47.6060 | -122.052 | 3860 | 7272 |
| 2211 | 9578060230 | 20140618T000000 | 535000.0 | 4 | 2.50 | 2610 | 4595 | 2.0 | 0 | 0 | 3 | 8 | 2610 | 0 | 2008 | 0 | 98028 | 47.7728 | -122.235 | 2440 | 4588 |
| 2212 | 6669080120 | 20141215T000000 | 405000.0 | 4 | 2.50 | 1980 | 5020 | 2.0 | 0 | 0 | 3 | 7 | 1980 | 0 | 2007 | 0 | 98056 | 47.5147 | -122.190 | 1980 | 5064 |
| 2213 | 2855000110 | 20140808T000000 | 388000.0 | 3 | 2.50 | 2198 | 6222 | 2.0 | 0 | 2 | 3 | 8 | 2198 | 0 | 2010 | 0 | 98198 | 47.3906 | -122.304 | 2198 | 7621 |
| 2214 | 3345700207 | 20150502T000000 | 608500.0 | 4 | 3.50 | 2850 | 5577 | 2.0 | 0 | 0 | 3 | 8 | 1950 | 900 | 2014 | 0 | 98056 | 47.5252 | -122.192 | 2850 | 5708 |
| 2215 | 6056111067 | 20140707T000000 | 230000.0 | 3 | 1.75 | 1140 | 1201 | 2.0 | 0 | 0 | 3 | 8 | 1140 | 0 | 2014 | 0 | 98108 | 47.5637 | -122.295 | 1210 | 1552 |
| 2216 | 2767600688 | 20141113T000000 | 414500.0 | 2 | 1.50 | 1210 | 1278 | 2.0 | 0 | 0 | 3 | 8 | 1020 | 190 | 2007 | 0 | 98117 | 47.6756 | -122.375 | 1210 | 1118 |